timtyler comments on [missing post]

timtyler 14 Apr 2012 22:14 UTC
0 points
0

There is a mathematical theorem about self-modifying agents that state an agent will not self modify to invalidate it’s utility function, because if the agent does that, the modified agent will not maximize the current agents utility function.

That’s not correrct. There is no such “mathematical theorem”.

Indeed we know that some agents will wirehead, since we can see things like heroin addicts, hyperinflation and Enron in the real world.
- pedanterrific 14 Apr 2012 22:25 UTC
  −3 points
  0
  Parent
  Humans don’t have utility functions, though.
  
  Edit: Oops. Apparently they do.
  - timtyler 14 Apr 2012 23:14 UTC
    0 points
    0
    Parent
    See: Any computable agent may described using a utility function.
    - pedanterrific 14 Apr 2012 23:33 UTC
      0 points
      0
      Parent
      Sorry, I notice you’ve had this argument at least once before. That’ll learn me to shoot my mouth off. In my defense, the wiki just says “[utility functions] do not work very well in practice for individual humans” without any mention of this fact.
      
      However, I’m still not certain that you can take heroin addicts as proof that some agents self-modify to invalidate their utility functions.